Picture for Chao Hao

Chao Hao

GUI-C$^2$: Coarse-to-Fine GUI Grounding via Difficulty-Aware Reinforcement Learning

Add code
May 29, 2026
Viaarxiv icon

Seg-Agent: Test-Time Multimodal Reasoning for Training-Free Language-Guided Segmentation

Add code
May 13, 2026
Viaarxiv icon

YUV20K: A Complexity-Driven Benchmark and Trajectory-Aware Alignment Model for Video Camouflaged Object Detection

Add code
Apr 11, 2026
Viaarxiv icon

Dynamic Collaboration of Multi-Language Models based on Minimal Complete Semantic Units

Add code
Aug 26, 2025
Viaarxiv icon

Distribution-Specific Learning for Joint Salient and Camouflaged Object Detection

Add code
Aug 08, 2025
Viaarxiv icon

Uncertainty-Aware GUI Agent: Adaptive Perception through Component Recommendation and Human-in-the-Loop Refinement

Add code
Aug 06, 2025
Viaarxiv icon

LangTime: A Language-Guided Unified Model for Time Series Forecasting with Proximal Policy Optimization

Add code
Mar 11, 2025
Figure 1 for LangTime: A Language-Guided Unified Model for Time Series Forecasting with Proximal Policy Optimization
Figure 2 for LangTime: A Language-Guided Unified Model for Time Series Forecasting with Proximal Policy Optimization
Figure 3 for LangTime: A Language-Guided Unified Model for Time Series Forecasting with Proximal Policy Optimization
Figure 4 for LangTime: A Language-Guided Unified Model for Time Series Forecasting with Proximal Policy Optimization
Viaarxiv icon

TRRG: Towards Truthful Radiology Report Generation With Cross-modal Disease Clue Enhanced Large Language Model

Add code
Aug 22, 2024
Figure 1 for TRRG: Towards Truthful Radiology Report Generation With Cross-modal Disease Clue Enhanced Large Language Model
Figure 2 for TRRG: Towards Truthful Radiology Report Generation With Cross-modal Disease Clue Enhanced Large Language Model
Figure 3 for TRRG: Towards Truthful Radiology Report Generation With Cross-modal Disease Clue Enhanced Large Language Model
Figure 4 for TRRG: Towards Truthful Radiology Report Generation With Cross-modal Disease Clue Enhanced Large Language Model
Viaarxiv icon

From Recognition to Prediction: Leveraging Sequence Reasoning for Action Anticipation

Add code
Aug 05, 2024
Figure 1 for From Recognition to Prediction: Leveraging Sequence Reasoning for Action Anticipation
Figure 2 for From Recognition to Prediction: Leveraging Sequence Reasoning for Action Anticipation
Figure 3 for From Recognition to Prediction: Leveraging Sequence Reasoning for Action Anticipation
Figure 4 for From Recognition to Prediction: Leveraging Sequence Reasoning for Action Anticipation
Viaarxiv icon

A Simple yet Effective Network based on Vision Transformer for Camouflaged Object and Salient Object Detection

Add code
Feb 29, 2024
Figure 1 for A Simple yet Effective Network based on Vision Transformer for Camouflaged Object and Salient Object Detection
Figure 2 for A Simple yet Effective Network based on Vision Transformer for Camouflaged Object and Salient Object Detection
Figure 3 for A Simple yet Effective Network based on Vision Transformer for Camouflaged Object and Salient Object Detection
Figure 4 for A Simple yet Effective Network based on Vision Transformer for Camouflaged Object and Salient Object Detection
Viaarxiv icon